Exploiting non-quantum entanglement to widen applicability of limited-entanglement 

classical simulations of quantum systems. 
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It is known that if the quantum gates in a proposed quantum computer are so noisy that they 
are incapable of generating entanglement, then the device can be efficiently simulated classically. 
If the measurements and single particle operations are restricted, then the same statement can be 
true for generalised non-quantum notions of entanglement. Here we show that this can improve the 
applicability of limited-entanglement simulation algorithms. In particular, we show that a classical 
simulation algorithm of Harrow & Nielsen can efficiently simulate magic state quantum computers 
that are ideal other than noise on the CNOTs for joint depolarising strengths of 272/489 ~ 56%, in 
contrast to noise levels of 2/3 ~ 66% required if the algorithm uses quantum notions of separability. 
This suggests that quantum entanglement may not be the most appropriate notion of entanglement 
to use when discussing the power of stabilizer based quantum computers. 



INTRODUCTION 

Entanglement is often cited as the main reason why 
quantum devices cannot be efficiently simulated on clas- 
sical computers. However, while it is the case that devices 
with limited entanglement can often be efficiently classi- 
cally simulated [3j-|6j, there can also be highly entangled 
systems that can be efficiently simulated classically [JEl]. 
The most prominent example of this arises in fault tol- 
erant quantum computation. In such architectures the 
stabilizer components generate the quantum entangle- 
ment, but acting alone they can be efficiently simulated 
classically [l|. Universal quantum computation is usu- 
ally achieved by injecting a single qubit state, gate, or 
measurement into the circuit. So fault tolerant quantum 
computation appears to provide a regime where entan- 
glement is not the source of the computational power. 

Motivated by this example, in this paper we will 
present arguments that ordinary quantum entanglement 
may not be the most appropriate notion of entanglement 
to use when discussing the computational power of sta- 
bilizer based computations. We will do this by showing 
that using a generalised notion of entanglement, as dis- 
cussed in we may significantly enhance the range of 
applicability of classical simulation algorithms that rely 
on limited entanglement. In particular we will consider 
applying the classical simulation algorithm of Harrow 
& Nielsen 0] to quantum computers built using stabi- 
lizer components and non-stabilizer single qubit resources 
(the so-called 'magic-state' architectures [7[). If quan- 
tum entanglement is used, the Harrow-Nielsen algorithm 
requires that the CNOT undergo 2/3 ^66% of joint de- 
polarising noise in order to provide an efficient classi- 
cal simulation. However, we will show that in the case 
of magic-state architectures, an alternative notion of en- 
tanglement can improve the range of applicability of the 
algorithm to less than 56% noise on the CNOT. In the 
same spirit as the research programme proposed by Q, 



this suggests that in the context of fault tolerant quan- 
tum computation there may be different notions of en- 
tanglement that are more appropriate when discussing 
computational power. 

Note that our work is distinct from the usual approach 
to analysing noise induced classicality in magic-state 
computers, which treats the non-stabiliser resource as the 
source of non-classicality, and aims to to understand the 
noise levels required before this resource loses its power 
0, @| ■ Here we are treating the single qubit non-stabilizer 
resources as classical, but assessing the power brought by 
the generalised-entangling power of the CNOT. 



GENERALISED ENTANGLEMENT AND 
CLASSICAL SIMULATION 

A quantum state of two particles A and B is said to be 
quantum separable if it can be written as a probabilis- 
tic mixture of products of single particle quantum states 
PA® Pb- In the context of qubits, pA and ps must be 
quantum states drawn from the Bloch sphere, which we 
denote by the symbol Q. This conventional definition 
of entanglement can be generalised for any convex 
set C of single particle Bloch vectors (perhaps including 
ones outside the Bloch sphere) , we define an operator to 
be C-separable if it can be decomposed as a probabilistic 
mixture of product 'states' pa®Pb such that pA, Pb £ C. 

The notion of quantum separability is utilised in the al- 
gorithm developed by Harrow & Nielsen Q to classically 
simulate devices with non-entangling gates. The algo- 
rithm samples the separable decomposition of the system 
after each gate has been applied, and stores the product 
state that results from each such sampling step. The 
final result of the computation is then obtained by sam- 
pling the probabilities of measurement outcomes on each 
individual particle. The algorithm is efficient as it only 
ever stores and manipulates product operators, which is 
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computationally inexpensive. 



NOTATION 



If we allow all single particle measurements, then to 
be able to perform the final sampling step it is essen- 
tial that the variables stored correspond to single parti- 
cle quantum states - otherwise the final sampling will in- 
volve negative probability distributions. However, if the 
measurements in the device are restricted, then the HN 
algorithm will work even if the gates are non-entangling 
w.r.t. any set C that is contained within the normalised 
dual M* of the set of available measurement operators 
M (see e.g. 



We will represent two qubit operators in one of two 
ways, both relying upon the Pauli expansion: 



C C M* := {p\tr{p} = l,tr{pB} > 0,VS G A/}, (1) 



In the case where M is the set of all quantum measure- 
ments, then M* is the set of quantum states. If M is a 
restricted, then M* will be correspondingly lar ger . The 
notion of entanglement changes accordingly [jl llo|. For 
example, if M is the set of Pauli measurements, then Bell 
pairs become separable (see e.g. fllj]). 

[Aside: It is important to note one subtlety: for the 
classical simulation to function with such a modified no- 
tion of entanglement we also require that the single parti- 
cle operations cannot produce single particle states out- 
side C, i.e. that the single particle states can only be 
initialised within C, and C must remain invariant under 
the action of any single particle gates.] 



In [ll| such considerations were applied to understand 
the effects of noise on computation in (among other situ- 
ations) magic state quantum computation, in which sin- 
gle particle unitaries are drawn from the Clifford group 
and the measurements M are in Pauli directions only. 
While this did provide new regimes that could be effi- 
ciently simulated classically, it did not lead to significant 
improvements in the applicability of the HN algorithm, 
in that the levels of noise required to remove the entan- 
gling power of the CNOT were typically no lower than 
those required to remove quantum entanglement. 



However in the set C was taken to be the full dual, 
i.e. C = M *, and this is not required as we can only ever 
initialise the qubits in genuine quantum states. So in 
fact one can consider choices of C such that Q C C C 
M* . In this work we show that in the context of magic 
state architectures there exist choices of C such that Q C 
C C M* that can lead to quite large differences in the 
entangling power of the CNOT (CX) or CSIGN (CZ) 
gate. In particular we find that the CZ gate requires only 
56% depolarising noise to remove its entangling power, 
in contrast to 2/3 = 66% if we choose C = Q. 



i,j=0,..,3 



(2) 



where gq , <j\ , 02 , 03 represent the 2x2 Identity and the 
Pauli X,Y,Z operators respectively. In fact as we will 
only consider trace normalised operators, we will always 
have poo = 1. We will display the coefficients {pij} as a 
4x4 matrix, with rows/columns numbered from 0,..,3: 



I poo = 1 Pol P02 P03 \ 

PlO Pll Pl2 Pl3 

P20 p21 P22 P23 

\ P30 PSl P32 P33 j 



(3) 



or as a column vector formed by sequentially concatenat- 
ing the columns of this matrix, i.e. 

(p00,PW, P20, P30, P01, •-, P31; P02, •-, P32, P03, ■-, P33) T ■ 

(4) 

Please note that we do not represent 2-qubit operators as 
density matrices in the computational basis - we always 
use this Pauli expansion form. 

Although the results can be modified straightforwardly 
to any 2-qubit Clifford gate, the only gate that we will 
consider in this work is the CSIGN or CZ gate, which is 
the unitary |0)(0| <g> I + |1)(1| ® Z. 

We will use the symbol C\ to denote an ideal CZ gate 
followed by joint depolarising noise, where with probabil- 
ity A the output of the gate is replaced with a maximally 
mixed state represented by a normalized identity. 

The sets C that we consider consider here will be 
truncated cubes of Bloch vectors. We have investigated 
a number of sets that are invariant under the singlc- 
qubit Clifford group, and truncated cubes are the best 
of these in the parameter ranges of interest. The trun- 
cated cubes (see fig. {T])) are parameterised by a vari- 
able r G (0, 1] and are defined in the following way: 
TRUN{r) is the truncated cube given by the convex 
hull of Bloch vectors (x,y,z) of the form (\x\, \y\, \z\) = 
(1, 1, r), (1, r, 1), (r, 1, 1), i.e. the convex hull of all possi- 
ble sign choices and choices of either \x\, \y\ or \z\ equal 
to r. There are hence 24 extremal points of TRUN(r). 
For two particles A and B will need to consider prod- 
ucts of states selected from TRUN(r). We will use the 
shorthand {xa,VAi %a) ® {xb,Vb, %b) to denote to such 
states, where (xa, Ua> %a) an d {xbiUb, zb) are the indi- 
vidual Bloch vectors. In particular the maximally mixed 
state can be represented as (0, 0, 0) ® (0, 0, 0). 
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FIG. 1: A truncated cube. 

PROBLEM AND BASIC METHOD 

Our goal is to calculate, for various choices of C = 
TRUN(r), the level of joint depolarising noise required 
to make all output states separable w.r.t. TRUN{r). In 
particular we must go through all possible pairs of in- 
put extremal points, act upon them with C\ and find the 
minimal A required to make sure that all outputs are sep- 
arable. There are 24x24 possible pairs of extremal input 
states, but in fact it is sufficient to consider only four 
of them. The reason for this is that if a given operator 
p is separable w.r.t. TRUN(r), then so is any operator 
obtained from its matrix representation ((3]) by (a) per- 
muting any of the last three columns, (b) permuting any 
of the last three rows, (c) transposition of the matrix, 
(d) multiplying any of the last three rows or columns 
by minus signs. This is because all these operations are 
linear, and leave TRUN(r) invariant. Up to these trans- 
formations the outputs obtained from the full set of pairs 
of extremal input states are equivalent to the outputs of 
only four pairs, which we refer to as Cases 1 to 4: 

Pl := (1, 1, r) ® (1, 1, r ) p 2 := (r, 1, 1) ® (1, 1, r) 
P3 := (1, 1, r) g (1, r, 1) p 4 := (r, 1, 1) ® (1, r, 1) 

Hence our goal is to compute the minimal A required to 
make sure that the outputs C\(pi) are TRUN(r) sepa- 
rable for i = 1,..,4. Consider for instance Case 1. Wc 
must compute: 

min A 

s.t. (1 - X)CZ( Pl ) + A((0, 0, 0) ® (0, 0, 0)) 
G TRUN(r) - sep 

This is a linear programming problem, and hence can be 
readily solved using standard numerical tools. In fact, as 
we shall see it turns out that the structure of the prob- 
lem enables us to use computational algebra packages to 
obtain almost completely analytic solutions. We can re- 



express the linear programme as: 
min po 

s.t. CZ( Pl )= P" E n, (5) 

n=0,..,576 

Pn > (6) 

where 

E :={CZ{ Pl )- (0,0, 0)8(0,0,0)) , po := A 

and for n = 1,..,576 the E n denote pairs of extremal 
product states from TRUN(r), and the p n represent the 
probabilities with which they occur in the separable de- 
composition. We do not need to impose normalisation 
^2 n>0 p n = 1, as this is implicitly required by the other 
conditions. 

A numerical linear programming routine will usually 
return the minimal value of po (i.e. A) as well as the val- 
ues of p n (for n > 0) that achieve it. For our purposes 
it is important to note that the values of p n that achieve 
the optimum can always be chosen to be non-zero only 
on a linearly independent subset of vectors from E n . The 
reason for this that the optimum can always be achieved 
on an extremal point of the convex set of p n that satisfy 
the constraints, and it is well known in linear program- 
ming theory that the extremal points must be non-zero 
only on linearly independent subsets of E n . 

One can find such a linearly independent solution by 
first solving the the programme numerically, and then 
one by one removing the vectors E ni each time checking 
whether the optimal solution can still be attained. If at 
any step the removal of a particular E n prevents us from 
reaching the optimal solution, we put that vector back 
into the problem and carry on removing others. Eventu- 
ally we terminate at an optimal solution involving only 
a linearly independent subset of E n . 

This numerical solution can then be turned into an 
analytic proof that the value of A is achievable. One 
simply takes the symbolic version of the matrix formed 
from the optimal linearly independent subset of E n , and 
then inverts the condition (JS|) to obtain an analytic ex- 
pression for the optimal p n . The solutions obtained in 
this way are not usually simple, as the optimal separable 
decompositions are usually not unique, and so the com- 
puter does not necessarily identify the neatest solution. 
However, for completeness in the appendix we present 
example decompositions for the region around r = 1/2. 

While this demonstrates achievability analytically, it 
does not demonstrate necessity analytically - i.e. that 
the solution is optimal. One approach to attempt to fill 
this gap is to obtain a set of inequalities (the analogue of 
entanglement witnesses) that are necessary conditions for 
any operator to be Ti?£7 JV(r)-separable, and then turn 
these into inequalities on A that could match the achiev- 
able values of A. In the next section we will see that in 



4 



the regime around r = 1/2 (which appears to be the best 
region) we have been able to find suitable inequalities for 
Cases 2-4 above, and hence the optimal A for these cases 
analytically. In case 1 we have not been able to identify 
suitable inequalities. However, putting all four cases to- 
gether for r = 1/2 we find that A = 272/489 - 0.56 is 
analytically achievable, whereas A = 5/9 ~ 0.55 is neces- 
sary. So our analytic solutions are essentially tight. 



RESULTS AND NECESSARY CONDITIONS 
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As the truncated cubes that we consider are within the 
dual of ideal Pauli measurements, it is a necessary con- 
dition for separability that the output operators return 
positive values for the probabilities of measurement out- 
comes for the various Pauli operators. It turns out that 
for values of r or most interest such inequalities are For 
instance, consider Case 4. Under the action of C\ the 
output operator is: 



(7) 



On this operator the requirement of having a positive 
probability of getting down, down in a measurement of 
the X and Y operators leads to: 

A>l-(l/(2 + r 2 )). (8) 

Performing identical computations for Cases 1-3 gives: 

Casesl&3: A > 1 - (1/(1 + 2r)) (9) 
Case2: A > 1 - (1/(2 + r 2 )) (10) 

We will now see that in cases 2-4 these inequalities are 
tight. The numerical results for Cases 1-4 are plotted in 
figure [U For all values of r, Case 4 matches its lower 
bound in equation l[5|). For values of r decreasing from 
r = 1 the other three cases also match the lower bounds 
(|9I10[) . but for each of these cases there is a value of r 
below which these bounds become unattainable. 

We see from these results that r = 1/2 appears to 
require the least noise. This value of r is also large enough 
to include the whole Bloch sphere, and hence all magic- 
states. At the point r = 1/2, Cases 2,3 and 4 attain the 
lower bounds (fTO]) . (|9|) . (|8|) . hence showing that the higher 
of these values 1 — 1/(2 + r 2 ) = 5/9 = 0.5555 is necessary 
and sufficient for these cases. With Case 1 we have not 
been able to identify matching lower bounds, however the 
method of the previous section can be used to show that 
in the vicinity of r = 1/2 we may achieve values of 



FIG. 2: Plot of the minimal noise level A required to make the 
output of a noisy CZ TRUN(r) separable, for each of the four 
cases. For all values of r, Case 4 matches its lower bound in 
equation |[HJ|. For values of r decreasing from r = 1 the other 
three cases also match the lower bounds (ItJIlOf) . but for each 
of these cases there is a value of r below which these bounds 
become unattainable. 



SUMMARY AND OPEN PROBLEMS 

We have shown that using a non-quantum notion of en- 
tanglement can increase the range of application of Har- 
row & Nielsen's limited entanglement simulation method. 
However, we do not know whether this persists for more 
general noise models, or in the context of multiparticlc 
entanglement. 

We also do not know whether the truncated cubes are 



the optimal set for our problem. In future work [12j we 
will present lower bounds on the A required for any set 
C, where for any Q C C C M* we can show that A 
must be greater than 0.42. However, truncated cubes 
are currently the best among the choices of C that we 
have tried. 



In 



121 ] we will also present an analysis of 
a generalised problem where the sets Q and M * are re- 
duced and expanded respectively by noise. In very noisy 
regimes one can identify the optimal sets that require the 
least amount of depolarising noise to make the CZ sepa- 
rable (they turn out to be either spheres or octahedra), 
but for the most interesting case of noise-free Q and M* 
it open whether truncated cubes arc the best choice. 
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A = (4 r 2 + 8 r + 12)/(r 4 + 2 r 3 + 9 r 2 + 16 r + 20). 



At r = 1/2 this evaluates to 272/489 = 0.5562. 
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APPENDIX: COMPUTER GENERATED EXPRESSIONS FOR ACHIEVABLE VALUES OF A NEAR 

r = 1/2. 

In this appendix, for r = 1/2, we present TRUN(r) separable decompositions that show that the numerically 
obtained values of A are achievable. The decompositions have been obtained using the computational methods 
described in the main text. They are extremely messy objects - this may be a reflection of the fact that the optimal 
separable decompositions are usually non-unique. It is possible that other methods may lead to more concise and 
illustrative expressions. Indeed, the solution for Case 4 was guessed by hand, and the expressions are much simpler 
than the remaining cases. 



CASE 1 



For Case 1, and a noise level given by 



A = 1 



r 4 + 2r 3 + 5r 2 + 8r + 8 
,4 + 2 r 3 + 9 r 2 + 16 r + 20 



(11) 



the output state of C\ is given by (using the column vector notation of equation (HJ)) 



/ r 4 +2 r a +9 '^ + 16 r+20 \ 

315 t'-I+F, r^4-R r+S 



r 4 + 2r 3 + 5r 2 + 8r + 8 
" 4 + 2r 3 + 9r 2 + 16r + 20 



,.4 +2 r a +5r 2 +8 

r 
r 
r 
r 
1 

-1 
1 
/• 

-1 
1 
1 
r 
1 
1 



(12) 



The separable decomposition of this state is given by the following expression, where the columns of the left matrix 
correspond to product operator extrema (in the notation of equation (U) ) , and the elements of the right column vector 
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are the associated probabilities. In the vicinity of r 
works for a range of r. 



1/2 all the probabilities are positive, and so the decomposition 
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2 (r 2 -2 r+4) 
-r s -5r 3 +2 r 2 + 12 r+40 



4 (r 2 +r+l) 
(r+I) (r 4 +2 r :i +9 r' 2 + 16 r+20) 



2r 2 (r-1) 

_ r 5_5 r 3_|_2 r 2 + 12r +40 

2 r 2 (r-1) 

_ r 5_ 5 r a +2 r 2 + 12 r+40 



-r 6 +r 5 + 12 r 3 +21 r 2 +27 r-12 
4 (r 5 +3 r 4 + ll r :1 +25 r 2 +36 r+20) 
r t -r 6 +2 r 5 -8 r 4 -9 r 3 +r 2 +54 r+8 
4 (r e +r 5 +5r 4 +3r 3 -14r 2 -52 r-40) 
-r 6 -2 r 4 +6 r 3 -r 2 + 14 r+8 
4 (-r b -5 r a +2 r^ + 12 r+40) 
-r 7 +r 6 -6r 5 + 12 r 4 -llr 3 + ll r 2 +2r+40 
4 (r 6 +r 5 +5 r 4 +3 r 3 -14 r 2 -52 r-40) 

r (-r 5 -2r 3 +6r 2 +5r+16) 
— r 6 +r 5 +5r 4 + 3 r a_ 1 4 r 2_ 52 r _40 

-r 6 -2 r 4 +6 r 3 +5 r 2 + 12 r-8 
(r+1) (r-2) (r 4 +2r 3 +9r 2 + 16r+20) 
2 (r 2 -2r+4) 
_ r 5_5 r 3_|_ 2 r 2 + 12 r+40 



CASE 2 



For Case 2, in the vicinity of r = 1/2, and for a noise level given by 



A = 1 



2 + r 2 



(13) 



the output state of C\ is given by (using the column vector notation of equation 



2 + r 2 



/ 2 + r 2 \ 

r 2 

r 
1 
1 
1 

— r 
1 
1 

-1 
r 
1 
r 
r 
1 
r 



(14) 
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The probabilities p m appearing in the decomposition arc given by: 



3r 17 +9r 16 -44r 15 -121r 14 -92r 13 -49r 12 +692r 11 +807r 1Q + 1458r 9 +305r s + 1020r 7 -419r 6 +772r 5 -371r 4 +252 r 3 -139r 2 +35r-22 
4r(r 2 +2) (r 15 +5 r 44 +2 r 13 + 12 r 12 +47 r 11 -129 r 10 -88 r 9 -122 r 8 + 15 r 7 +99 r 6 +26r5+96 r 4 +r 3 +25 r 2 -4 r+14) 
r 17 +5r 16 +22 r 15 +73 r 14 +98 r 13 +447r 12 -154r 11 +805r 1 °-476?; 9 + 1245 r 8 -766 r 7 + 1131 r 6 -642 r 5 +357 r 4 -126 r 3 +39 r 2 -5 r-6 
4r(r 2 +2) (r 14 +4 r 13 -2 r 12 + 14 r 11 +33 r 1 "- 162 r u +74 r s -196 r 7 +211 r B - 112 r 5 + 138 r 4 -42r 3 +43 r 2 - 18 r+14) 
-r 18 -4r 17 + 17r 16 +87r 15 + 159r 14 +297 r 13 +355 r 12 +419r 11 + 1041r 10 +405 r 9 + 1201 7- 8 -331 r 7 +917 r 6 -661 r 5 +353 r 4 -175 r 3 +60 r 2 -37 r-6 
4r(r 2 +2) (r 15 +5 r 14 +2 r 13 + 12 r 12 +47 r 11 -129 r ll) -88 r ,J -122 r 8 + 15 r 7 +99 r B +26r 5 +96 r 4 +r 3 +25 r 2 -4 r+14) 

-2r 15 -r 14 +3r 13 +60r 12 +72r 11 +233r 10 +293r 9 +406r 8 +38r 7 +289r 6 -259r 5 +48r 4 -108r 3 -9r 2 -37r-2 
2 (r 2 +2) (-r 14 -6 r 13 -8 r 12 -20 r 11 -67 r lu +62 r y + 150 r 8 +272 r 7 +257 r 6 + 158 r 5 + 132 r 4 +36 r 3 +35 r 2 + 10 r+14) 
■r 17 + ll r 16 +35 r +55 r 14 + 125 r 13 + 17r 12 + 175 r 11 +219 r 10 +579 r 9 + 195 r 8 +585 r 7 -235 r 6 +431 r 5 -213 r 4 + 101 r 3 -39 r 2 + 16 r-10 
2 (r 2 +2) (r 15 +5 r 14 +2 r 13 + 12 r 12 +47 r 11 - 129 r lll -88 r !J -122 r s + 15 r 7 +99 r e +26 r 5 +96 r 4 +r 3 +25 r 2 -4r+14) 
-r 16 +3r 15 +58 r 14 + 131r 13 +214 r 12 +487r 11 + 122r 10 +551r 9 -156 r 8 +489 r 7 -178r 6 +361r 5 -54r 4 +45 r 3 -2r 2 -19r-3 

2 fr 2 +2) (r 15 +5 r 14 +2 r 13 + 12 r 12 +47 r 11 - 129 r lll -88 r 9 -122 r 8 + 15 r 7 +99 r 6 +26 r 5 +96 r 4 +r 3 +25 r 2 -4r+14) 
(r+1) (r 15 +3 r 14 + 15 r 13 +52 r 12 + 117 r 41 +49 r 10 +467 r 9 -278 r 8 +675 r 7 -427 r 6 +573 r 5 -336 r 4 + 167 r 3 -73 r 2 +33 r-14) 
4(r 2 +2) (r-1) (r 13 +5 r 12 +3 r 11 + 17 r 10 +50 r 9 - 112 r 8 -38 r 7 -234 r 6 -23 r 5 - 135 r 4 +3 r 3 -39 r 2 +4 r-14) 

2 (V 2 + l) (2r 12 + ll r 11 + 14r 10 +31r 9 +68r 8 +41r 7 + 73r 6 -23r 5 +31r 4 +5r 2 +4r-l) 
~ (r 2 +2) (r+1) (r 13 +5r 12 +3r 1:l + 17 r lll +50r 9 -112r s -38r 7 -234r 6 -23r 5 -135 r 4 +3 r 3 -39 r 2 +4 r- 14) 
(r-1) 2 (-r 13 + 17r 11 +55r 10 +84r 9 +89r 8 + 106r 7 + 146r 6 +63r 5 +142r 4 -27r 3 +55 r 2 + 14r+25) 
2 (r 2 +2) (-r 14 -6r 13 -8r 12 -20r 11 -67r 10 +62r" + 150r 8 +272 r 7 +257r B + 158r 5 + 132r 4 +36r 3 +35r 2 + 10r+14) 
r 16 +6r lj +28r 14 +45r 13 + 101r 12 -108r 11 +826 r 1Q -317 r 9 + 1397r 8 -666r 7 + 1032r 6 -685r 5 +531r 4 -256r 3 + 162 r 2 -67r+18 
4(r 2 +2) (r 14 +4 r 13 -2r 12 + 14r 11 +33r 10 -162 r 9 +74 r s - 196 r 7 +211 r 6 - 112 r 5 + 138 r 4 -42 r 3 +43 r 2 - 18 r+14) 

2r l5 -5 r 14 +9 r 13 -84r 12 +2r 11 -429r ln -23r 9 -444r 8 +94r 7 -143 r 6 -53 r 5 +76 r 4 -34 r 3 +r 2 +3 r+4 
2(r 2 +2) (r 14 +4r 13 -2r 12 + 14r 11 +33r lu -162 r 9 +74 r s - 196 r 7 +211 r 8 - 112 r 5 + 138 r 4 -42 r 3 +43 r 2 - 18 r+14) 
r 16 + 10 r 15 +35r 14 +78r 13 + 181r 12 +200r 11 +191r 1Q -8r 9 + 135r 8 -110r 7 +241r 6 -122r 5 + 183 r 4 -36 r 3 +45 r 2 -12 r+12 
2 (r 2 +2) (-r 14 -6r 13 -8r 12 -20r 11 -67r ll) +62r u + 150r 8 +272 r 7 +257re + 158r 5 + 132 r 4 +36r 3 +35r 2 + 10r+14) 
-r 15 +6r 14 +31r 13 + 100r 12 + 102r 11 +291r 1Q +94r 9 +305r 8 +35r 7 + 156r 6 -149r 5 + 142r 4 -iq4r 3 +27r 2 -8r-3 
2(r 2 +2) (r 14 +4r 13 -2 r 12 + 14r 11 +33r I "-162r u +74r 8 -196r 7 +211r B -112r b + 138r 4 -42r a +43 r 2 - 18 r+14) 
(r 2 + l) (r 14 +9r 13 +29r 12 +67r 1:l + 140r 10 +86r 9 + 180r 8 -6r 7 +77r 6 -119r 5 +65 r 4 -29r 3 +22 r 2 -8r-2) 
~ (r 2 +2) (r-1) (r+1) (r 13 +5 r 12 +3 r 41 + 17 r 10 +50 r 9 - 112 r 8 -38 r 7 -234 r 6 -23 r 5 - 135 r 4 +3 r 3 -39 r 2 +4 r-14) 
r 16 -4r 15 -20r 14 -55 r 3 +89r 12 +574r 11 + 1102 r 10 + 1467r 9 + 1029r 8 + 1248 r 7 + 144 r 6 +607 r 5 -201 r 4 +230 r 3 -74 r 2 +29 r-22 
4r (r 2 +2) (-r 14 -6 r 13 -8 r 12 -20 r 11 -67 r lu +62 r 9 + 150 r 8 +272 r 7 +257 r B + 158 r 5 + 132 r 4 +36 r 3 +35 r 2 + 10 r+14) 



corresponding to the product extrema given by the columns of the following matrix: 
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CASE 3 



For Case 3, in the vicinity of r = 1/2, and for a noise level given by 



^-IT^ (15) 



8 



the output state of C\ is given by (using the column vector notation of equation (HJ): 



1/(1 + 2r) 



/ 1 + 2r \ 
/• 
1 
1 
r 
1 

— r 
r 
1 

— r 

,.2 

1 
1 

C 
1 

V i J 



The probabilities p m appearing in the separable decomposition are given by: 



(r-1) 2 (-r 6 +r 5 + 10r 3 +7r 2 + 13r-6) 
4 (2r+l) (r+1) 2 (-r 5 +r 4 +2 r 2 +r+l) 
-2r 9 +r 8 +8r 7 -4r 6 + 12r 5 -18r 4 +32r 3 -60r 2 +46r-15 
4(2r+l) (r+1) 3 (-r 5 +r 4 +2 r 2 +r+l) 



(r-1) 2 (-r 4 +4r 2 f8r+l) 
4(2r+l) (-r 5 +r 4 +2r 2 +r+l) 
(r-1) 2 (-3r 6 +r 4 +8r 3 +r 2 + l) 
2r(2r+l) (r+1) 2 (-r 5 +r 4 +2 r 2 +r+l) 
(r-1) (-2r 5 +r 4 +4r 3 +10r 2 +2r+l) 
_ (2 r+1) (r 2 + l) (r+1) 2 (-r 3 +r 2 +r+l) 
-r'+r 5 +ll) r 4 +r 3 -4 r 2 -9 r+2 
4r(2r+l) (r+1) 2 (-r 3 +r 2 +r+l) 
^-3r 7 +2r 6 +6r 4 -7r 3 +2r-i 



r(2r+l) (r+1) 2 (-r 5 +r 4 +2 r 2 +r+l) 



i +2 r z +2 r+1 



+ 4 



2r+l 

r (-r 5 -r 4 +2r 3 +6r 2 +7r-5) 
2 (2r+l) (r 2 + l) (r+1) (-r 3 +r 2 +r+l) 
2r 8 -5r 7 +7r a -9r 5 +7r 4 -19r 3 +9r 2 +r-l 
4r 2 (2 r+1) (-r 5 +r 4 +2 r 2 +r+l) 
-2 r 9 +3 r 8 -8 r 7 + 16 r-8 r 5 +30 r 4 -24 r 3 +8 r 2 +2r-l 
4r 2 (2r+l) (r 2 + l) (r+1) (-r 3 +r 2 +r+l) 
-2 r 10 +2 r fl -3 r 8 +6 r 7 +6 r 6 + 14 r 5 +20 r 4 -30 r 3 +28 r 2 -8 r-1 
4r(2r+l) (r+1) 2 (-r 5 +r 4 +2 r 2 +r+l) 
-2r 11 +2r 10 +3r 9 +3 r 8 + 16 r 7 - 16 r 6 + 14 r 5 -58 r 4 +58 r 3 -26r 2 +7r-l 
4r (2 (r+1) 3 (-r 5 +r 4 +2 r 2 +r+l) 

1 _ 7 , 1 

r+1 4 (2 r+1) ' 4 



(16) 
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corresponding to the product extrcma given by the columns of the following matrix: 
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CASE 4 



The separable decomposition in Case 4 was guessed by hand, and hence is simpler that the expressions derived 
above using numerical and computational algebra techniques. For Case 4 and a noise level given by: 



A = 1 



1 



the output state of C\ is given by (using the column vector notation of equation (J3J): 



( 2 



r 1 1 \ 

r —r 2 1 
— 1 r r 
r 11/ 



(17) 



The separable decomposition of this state is given by the following expression, which works for all values of r (as the 
probabilities are always positive): 



(I r ll\ 
1 r 1 1 

\1 r I I J 



+ 



2 + r 2 



( 1 \ 
0-10 
0-100 

\ o o ooy 



(18) 



where the rightmost matrix is simply a uniform mixture of all product vectors (xa, VAi %a) ® (^b, J/b, zb) that satisfy 
|ajj4| = \xb\ = \va\ = \vb\ = 1, \ z a\ = \ z b\ = t, and the anticorrelation condition xa = —ys and xb = — VA (which 
ensures the presence of the central —1 elements). 



